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INDUCING CELLULAR IMMUNE RESPONSES TO HUMAN 
IMMUNODEFICIENCY VIRUS-1 USING PEPTIDE AND NUCLEIC ACID 

COMPOSITIONS 



1 0 CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims priority to U.S. Application No. 09/412,863 filed 
October 5, 1999, which is herein incorporated by reference. 

FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

1 5 This invention was funded, in part, by the United States government under grants 

with the National Institutes of Health. The U.S. government has certain rights in this 
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I. BACKGROUND OF THE INVENTION 

Acquired immunodeficiency syndrome (AIDS) caused by infection with human 
immunodeficiency virus- 1 (HIV-1) represents a major world health problem. Estimates 
indicate that about 16,000 people worldwide are infected with HIV each day. 
5 The development of anti-viral drugs has been a major advancement in reducing 

viral loads in HIV infected patients. Highly active retroviral therapy (HAART) has been 
shown to reduce viremia to nearly undetectable levels. However, current drug therapies 
are not practicable as a long term solution to the HIV epidemic. HAART therapy is 
severely limited due to poor tolerance for the drugs and the emergence of drug-resistant 

10 virus. Moreover, replication competent HIV persists in the lymphoid tissue of patients 

who have responded to HAART, thus serving as a reservoir of virus. Lastly, current anti- 
retroviral drug therapies have little impact upon the global epidemic: almost 90% of the 
world's HIV infected population resides within countries lacking financial resources for 
these drugs. Thus, a need exists for an efficacious vaccine to both prevent and treat HIV 

15 infection. 

Virus-specific, human leukocyte antigen (HLA) class I-restricted cytotoxic T 
lymphocytes (CTL) are known to play a major role in the prevention and clearance of 
virus infections in vivo (Oldstone et al. 9 Nature 321 :239, 1989; Jamieson et ah, J. Virol. 
61:3930, 1987; Yap et al, Nature 273:238, 1978; Lukacher et ah, J. Exp. Med. 160:814, 

20 1994; McMichael et al., N. Engl. J. Med. 309:13, 1983; Sethi et ah, J. Gen. Virol. 64:443, 
1983; Watari et al., J. Exp. Med. 165:459, 1987; Yasukawa et al., J. Immunol. 143:2051, 
1989; Tigges et al., J. Virol. 66:1622, 1993; Reddenhase et al., J. Virol. 55:263, 1985; 
Quinnan et al., N. Engl. J. Med. 307:6, 1982). HLA class I molecules are expressed on 
the surface of almost all nucleated cells. Following intracellular processing of antigens, 

25 epitopes from the antigens are presented as a complex with the HLA class I molecules on 
the surface of such cells. CTL recognize the peptide-HLA class I complex, which then 
results in the destruction of the cell bearing the HLA-peptide complex directly by the 
CTL and/or via the activation of non-destructive mechanisms e.g., the production of 
interferon, that inhibit viral replication. 

30 While immune correlates of protective immunity against HIV infection are not 

well defined, there is a growing body of evidence that suggests CTL are important in 
controlling HIV infection. HIV-specific CTL responses can be detected early in infection 
and the appearance of the responses corresponds to the time in infection at which initial 
viremia is reduced (Pantaleo et al, Nature 370:463, 1994; Walker et aL, Proc. Natl 
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Acad. Sci. 86:9514, 1989). In addition, HIV replication in infected lymphocytes can be 
inhibited by incubation with autologous CTL (see, e.g., Tsubota et al, J. Exp. Med. 
169:1421, 1989). These data are supported by recent studies that indicate CTL are 
required for controlling viral replication in a SIV/rhesus animal model (Schmitz et al, 
5 Science 283:857, 1999), and additionally supported by studies that demonstrate that CTL 
exert selective pressure on HIV populations as evidenced by the eventual predominance 
of viruses with amino acid replacements in those regions of the virus to which CTL 
responses are directed (see, e.g., Borrow et al, Nature Med. 3:205-21 1, 1997; Price et al., 
Proc. Nat Acad. Sci. 94:12890-1895, 1997; Koenig et al, Nature Med. 1:330-336, 1995; 

1 0 and Haas et al , J. Immunol 1 57:42 1 2-422 1 , 1 996) 

Virus-specific T helper lymphocytes are also known to be critical for maintaining 
effective immunity in chronic viral infections. Historically, HTL responses were viewed 
as primarily supporting the expansion of specific CTL and B cell populations; however, 
more recent data indicate that HTL may directly contribute to the control of virus 

1 5 replication. For example, a decline in CD4 + T cells and a corresponding loss in HTL 

function characterize infection with HIV (Lane et al, New Engl J. Med. 313:79, 1985). 
Furthermore, studies in HIV infected patients have also shown that there is an inverse 
relationship between virus-specific HTL responses and viral load, suggesting that HTL 
play a role in viremia (see, e.g., Rosenberg et al, Science 278:1447, 1997). 

20 A fundamental challenge in the development of an efficacious HIV vaccine is the 

heterogeneity observed in HIV. The virus, like other retroviruses, rapidly mutates during 
replication resulting in the generation of virus that can escape anti-viral therapy and 
immune recognition (Borrow et al., Nature Med. 3:205, 1997). In addition, HIV can be 
classified into a variety of subtypes that exhibit significant sequence divergence (see, e.g., 

25 Lukashov et al, AIDS 12:S43, 1998). In view of the heterogeneous nature of HIV, and 
the heterogeneous immune response observed with HIV infection, induction of a multi- 
specific cellular immune response directed simultaneously against multiple HIV epitopes 
appears to be important for the development of an efficacious vaccine against HIV. 
There is a need to establish such vaccine embodiments which elicit immune responses of 

30 sufficient breadth and vigor to prevent and/or clear HIV infection. 

The epitope approach, as we have described, may represent a solution to this 
challenge, in that it allows the incorporation of various antibody, CTL and HTL epitopes, 
from various proteins, in a single vaccine compositions. Such a composition may 
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simultaneously target multiple dominant and subdominant epitopes and thereby be used 
to achieve effective immunization in a diverse population. 

The information provided in this section is intended to disclose the presently 
understood state of the art as of the filing date of the present application. Information is 
5 included in this section which was generated subsequent to the priority date of this 
application. Accordingly, information in this section is not intended, in any way, to 
delineate the priority date for the invention. 



II. SUMMARY OF THE INVENTION 

10 This invention applies our knowledge of the mechanisms by which antigen is 

recognized by T cells, for example, to develop epitope-based vaccines directed towards 
HIV. More specifically, this application communicates our discovery of specific epitope 
pharmaceutical compositions and methods of use in the prevention and treatment of HIV 
infection. 

15 Upon development of appropriate technology, the use of epitope-based vaccines 

has several advantages over current vaccines, particularly when compared to the use of 
whole antigens in vaccine compositions. There is evidence that the immune response to 
whole antigens is directed largely toward variable regions of the antigen, allowing for 
immune escape due to mutations. The epitopes for inclusion in an epitope-based vaccine 

20 may be selected from conserved regions of viral or tumor-associated antigens, which 
thereby reduces the likelihood of escape mutants. Furthermore, immunosuppressive 
epitopes that may be present in whole antigens can be avoided with the use of epitope- 
based vaccines. 

An additional advantage of an epitope-based vaccine approach is the ability to 
25 combine selected epitopes (CTL and HTL), and further, to modify the composition of the 
epitopes, achieving, for example, enhanced immunogenicity. Accordingly, the immune 
response can be modulated, as appropriate, for the target disease. Similar engineering of 
the response is not possible with traditional approaches. 

Another major benefit of epitope-based immune-stimulating vaccines is their 
30 safety. The possible pathological side effects caused by infectious agents or whole 

protein antigens, which might have their own intrinsic biological activity, is eliminated. 

An epitope-based vaccine also provides the ability to direct and focus an immune 
response to multiple selected antigens from the same pathogen. Thus, patient-by-patient 
variability in the immune response to a particular pathogen may be alleviated by inclusion 
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of epitopes from multiple antigens from the pathogen in a vaccine composition. In the 
case of HIV, epitopes derived from multiple strains may also be included. A "pathogen" 
may be an infectious agent or a tumor associated molecule. 

One of the most formidable obstacles to the development of broadly efficacious 
5 epitope-based immunotherapeutics, however, has been the extreme polymorphism of 
HLA molecules. To date, effective non-genetically biased coverage of a population has 
been a task of considerable complexity; such coverage has required that epitopes be used 
that are specific for HLA molecules corresponding to each individual HLA allele. 
Impractically large numbers of epitopes would therefore have to be used in order to cover 
10 ethnically diverse populations. Thus, there has existed a need for peptide epitopes that 
are bound by multiple HLA antigen molecules for use in epitope-based vaccines. The 
greater the number of HLA antigen molecules bound, the greater the breadth of 
population coverage by the vaccine. 

Furthermore, as described herein in greater detail, a need has existed to modulate 
15 peptide binding properties, e.g., so that peptides that are able to bind to multiple HLA 
antigens do so with an affinity that will stimulate an immune response. Identification of 
epitopes restricted by more than one HLA allele at an affinity that correlates with 
immunogenicity is important to provide thorough population coverage, and to allow the 
elicitation of responses of sufficient vigor to prevent or clear an infection in a diverse 
20 segment of the population. Such a response can also target a broad array of epitopes. 
The technology disclosed herein provides for such favored immune responses. 

In a preferred embodiment, epitopes for inclusion in vaccine compositions of the 
invention are selected by a process whereby protein sequences of known antigens are 
evaluated for the presence of motif or supermotif-bearing epitopes. Peptides 
25 corresponding to a motif- or supermotif-bearing epitope are then synthesized and tested 
for the ability to bind to the HLA molecule that recognizes the selected motif. Those 
peptides that bind at an intermediate or high affinity i.e., an IC50 (or a Kd value) of 500 
nM or less for HLA class I molecules or an IC50 of 1000 nM or less for HLA class II 
molecules, are further evaluated for their ability to induce a CTL or HTL response. 
30 Immunogenic peptide epitopes are selected for inclusion in vaccine compositions. 

Supermotif-bearing peptides may additionally be tested for the ability to bind to 
multiple alleles within the HLA supertype family. Moreover, peptide epitopes may be 
analogued to modify binding affinity and/or the ability to bind to multiple alleles within 
an HLA supertype. 
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The invention also includes embodiments comprising methods for monitoring or 
evaluating an immune response to HIV in a patient having a known HLA-type. Such 
methods comprise incubating a T lymphocyte sample from the patient with a peptide 
composition comprising an HIV epitope that has an amino acid sequence described in 
5 Tables VII to Table XX which binds the product of at least one HLA allele present in the 
patient, and detecting for the presence of a T lymphocyte that binds to the peptide. A 
CTL peptide epitope may, for example, be used as a component of a tetrameric complex 
for this type of analysis. 

An alternative modality for defining the peptide epitopes in accordance with the 
10 invention is to recite the physical properties, such as length; primary structure; or charge, 
which are correlated with binding to a particular allele-specific HLA molecule or group 
of allele-specific HLA molecules. A further modality for defining peptide epitopes is to 
recite the physical properties of an HLA binding pocket, or properties shared by several 
allele-specific HLA binding pockets (e.g. pocket configuration and charge distribution) 
1 5 and reciting that the peptide epitope fits and binds to the pocket or pockets. 

As will be apparent from the discussion below, other methods and embodiments 
are also contemplated. Further, novel synthetic peptides produced by any of the methods 
described herein are also part of the invention. 



20 III. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 : Figure 1 provides a graph of total frequency of genotypes as a function 
of the number of PF candidate epitopes bound by HLA- A and B molecules, in an average 
population. 

Figure 2: Figure 2 illustrates the position of peptide epitopes in an experimental 
25 model minigene construct. 

IV. DETAILED DESCRIPTION OF THE INVENTION 

The peptide epitopes and corresponding nucleic acid compositions of the present 
invention are useful for stimulating an immune response to HIV by stimulating the 
30 production of CTL or HTL responses. The peptide epitopes, which are derived directly or 
indirectly from native HIV protein amino acid sequences, are able to bind to HLA 
molecules and stimulate an immune response to HIV. The complete sequence of the HIV 
proteins to be analyzed can be obtained from Genbank. Peptide epitopes and analogs 
thereof can also be readily determined from sequence information that may subsequently 
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be discovered for heretofore unknown variants of HIV, as will be clear from the 
disclosure provided below. 

The peptide epitopes of the invention have been identified in a number of ways, as 
will be discussed below. Also discussed in greater detail is that analog peptides have 
5 been derived and the binding activity for HLA molecules modulated by modifying 

specific amino acid residues to create peptide analogs exhibiting altered immunogenicity. 
Further, the present invention provides compositions and combinations of compositions 
that enable epitope-based vaccines that are capable of interacting with HLA molecules 
encoded by various genetic alleles to provide broader population coverage than prior 
10 vaccines. 



IV.A. Definitions 

The invention can be better understood with reference to the following definitions, 
which are listed alphabetically: 
15 A "computer" or "computer system" generally includes: a processor; at least one 

information storage/retrieval apparatus such as, for example, a hard drive, a disk drive or 
a tape drive; at least one input apparatus such as, for example, a keyboard, a mouse, a 
touch screen, or a microphone; and display structure. Additionally, the computer may 
include a communication channel in communication with a network. Such a computer 
20 may include more or less than what is listed above. 

A "construct" as used herein generally denotes a composition that does not occur 
in nature. A construct can be produced by synthetic technologies, e.g., recombinant DNA 
preparation and expression or chemical synthetic techniques for nucleic or amino acids. 
A construct can also be produced by the addition or affiliation of one material with 
25 another such that the result is not found in nature in that form. 

"Cross-reactive binding" indicates that a peptide is bound by more than one HLA 
molecule; a synonym is degenerate binding. 

A "cryptic epitope" elicits a response by immunization with an isolated peptide, 
but the response is not cross-reactive in vitro when intact whole protein which comprises 
30 the epitope is used as an antigen. 

A "dominant epitope" is an epitope that induces an immune response upon 
immunization with a whole native antigen (see, e.g., Sercarz, et al. 9 Annu. Rev. Immunol. 
1 1 : 729-766, 1993). Such a response is cross-reactive in vitro with an isolated peptide 
epitope. 
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With regard to a particular amino acid sequence, an "epitope" is a set of amino 
acid residues which is involved in recognition by a particular immunoglobulin, or in the 
context of T cells, those residues necessary for recognition by T cell receptor proteins 
and/or Major Histocompatibility Complex (MHC) receptors. In an immune system 
5 setting, in vivo or in vitro, an epitope is the collective features of a molecule, such as 
primary, secondary and tertiary peptide structure, and charge, that together form a site 
recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this 
disclosure epitope and peptide are often used interchangeably. It is to be appreciated, 
however, that isolated or purified protein or peptide molecules larger than and comprising 

10 an epitope of the invention are still within the bounds of the invention. 

It is to be appreciated that protein or peptide molecules that comprise an epitope 
of the invention as well as additional amino acid(s) are still within the bounds of the 
invention. In certain embodiments, there is a limitation on the length of a peptide of the 
invention which is not otherwise a construct. An embodiment that is length-limited 

15 occurs when the protein/peptide comprising an epitope of the invention comprises a 
region (i.e., a contiguous series of amino acids) having 100% identity with a native 
sequence. In order to avoid the definition of epitope from reading, e.g., on whole natural 
molecules, there is a limitation on the length of any region that has 100% identity with a 
native peptide sequence. Thus, for a peptide comprising an epitope of the invention and a 

20 region with 100% identity with a native peptide sequence (and is not otherwise a 

construct), the region with 100% identity to a native sequence generally has a length of: 
less than or equal to 600 amino acids, often less than or equal to 500 amino acids, often 
less than or equal to 400 amino acids, often less than or equal to 250 amino acids, often 
less than or equal to 100 amino acids, often less than or equal to 85 amino acids, often 

25 less than or equal to 75 amino acids, often less than or equal to 65 amino acids, and often 
less than or equal to 50 amino acids. In certain embodiments, an "epitope" of the 
invention is comprised by a peptide having a region with less than 51 amino acids that has 
100% identity to a native peptide sequence, in any increment of (49, 48, 47, 46, 45, 44, 
43, 42, 41, 40, 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 

30 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5) down to 5 amino acids. 

Accordingly, peptide or protein sequences longer than 600 amino acids are within 
the scope of the invention, so long as they do not comprise any contiguous sequence of 
more than 600 amino acids that have 100% identity with a native peptide sequence, if 
they are not otherwise a construct. For any peptide that has five contiguous residues or 
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less that correspond to a native sequence, there is no limitation on the maximal length of 
that peptide in order to fall within the scope of the invention. It is presently preferred that 
a CTL epitope be less than 600 residues long in any increment down to eight amino acid 
residues. 

5 "Human Leukocyte Antigen" or "HLA" is a human class I or class II Major 

Histocompatibility Complex (MHC) protein {see, e.g., Stites, et al, Immunology, 8 th 
Ed., Lange Publishing, Los Altos, CA (1994). 

An "HLA supertype or family", as used herein, describes sets of HLA molecules 
grouped on the basis of shared pep tide-binding specificities. HLA class I molecules that 
10 share somewhat similar binding affinity for peptides bearing certain amino acid motifs are 
grouped into HLA supertypes. The terms HLA superfamily, HLA supertype family, 
HLA family, and HLA xx-like molecules (where xx denotes a particular HLA type), are 
synonyms. 

Throughout this disclosure, results are expressed in terms of "ICso's." IC 5 o is the 

15 concentration of peptide in a binding assay at which 50% inhibition of binding of a 
reference peptide is observed. Given the conditions in which the assays are run (i.e., 
limiting HLA proteins and labeled peptide concentrations), these values approximate K D 
values. Assays for determining binding are described in detail, e.g., in PCT publications 
WO 94/20127 and WO 94/03205. It should be noted that IC 5 o values can change, often 

20 dramatically, if the assay conditions are varied, and depending on the particular reagents 
used (e.g., HLA preparation, etc.). For example, excessive concentrations of HLA 
molecules will increase the apparent measured IC50 of a given ligand. 

Alternatively, binding is expressed relative to a reference peptide. Although as a 
particular assay becomes more, or less, sensitive, the ICso's of the peptides tested may 

25 change somewhat, the binding relative to the reference peptide will not significantly 

change. For example, in an assay run under conditions such that the IC50 of the reference 
peptide increases 10-fold, the IC50 values of the test peptides will also shift approximately 
10-fold. Therefore, to avoid ambiguities, the assessment of whether a peptide is a good, 
intermediate, weak, or negative binder is generally based on its IC 50 , relative to the IC50 

30 of a standard peptide. 

Binding may also be determined using other assay systems including those using: 
live cells (e.g., Ceppellini et al, Nature 339:392, 1989; Christnick et al, Nature 352:67, 
1991; Busch et al, Int. Immunol 2:443, 19990; Hill et al, J. Immunol 147:189, 1991; del 
Guercio et al,J. Immunol. 154:685, 1995), cell free systems using detergent lysates (e.g., 
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Cerundolo et aL, J. Immunol 21 :2069, 1991), immobilized purified MHC (e.g., Hill et 
aL, J. Immunol 152, 2890, 1994; Marshall et aL, J. Immunol 152:4946, 1994), ELISA 
systems (e.g., Reay et al,EMBO J. 1 1 :2829, 1992), surface plasmon resonance (eg., 
Khilko et al,J. Biol Chem. 268:15425, 1993); high flux soluble phase assays (Hammer 
5 et aL, J. Exp. Med. 180:2353, 1994), and measurement of class I MHC stabilization or 
assembly (e.g., Ljunggren et aL, Nature 346:476, 1990; Schumacher et aL, Cell 62:563, 
1990; Townsend et aL, Cell 62:285, 1990; Parker et aL, J. Immunol 149:1896, 1992). 

As used herein, "high affinity" with respect to HLA class I molecules is defined as 
binding with an IC 5 o, or K D value, of 50 nM or less; "intermediate affinity" is binding 

10 with an IC 50 or K D value of between about 50 and about 500 nM. "High affinity" with 
respect to binding to HLA class II molecules is defined as binding with an IC50 or Kd 
value of 100 nM or less; "intermediate affinity" is binding with an IC50 or K D value of 
between about 100 and about 1000 nM. 

The terms "identical" or percent "identity," in the context of two or more peptide 

1 5 sequences, refer to two or more sequences or subsequences that are the same ordiave a 
specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using a 
sequence comparison algorithm or by manual alignment and visual inspection. 

An "immunogenic peptide" or "peptide epitope" is a peptide that comprises an 

20 allele-specific motif or supermotif such that the peptide will bind an HLA molecule and 
induce a CTL and/or HTL response. Thus, immunogenic peptides of the invention are 
capable of binding to an appropriate HLA molecule and thereafter inducing a cytotoxic T 
cell response, or a helper T cell response, to the antigen from which the immunogenic 
peptide is derived. 

25 The phrases "isolated" or "biologically pure" refer to material which is 

substantially or essentially free from components which normally accompany the material 
as it is found in its native state. Thus, isolated peptides in accordance with the invention 
preferably do not contain materials normally associated with the peptides in their in situ 
environment. 

30 "Link" or "join" refers to any method known in the art for functionally connecting 

peptides, including, without limitation, recombinant fusion, covalent bonding, disulfide 
bonding, ionic bonding, hydrogen bonding, and electrostatic bonding. 

"Major Histocompatibility Complex" or "MHC" is a cluster of genes that plays a 
role in control of the cellular interactions responsible for physiologic immune responses. 
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In humans, the MHC complex is also known as the HLA complex. For a detailed 
description of the MHC and HLA complexes, see, Paul, Fundamental Immunology, 
3 rd Ed., Raven Press, New York, 1993. 

The term "motif refers to the pattern of residues in a peptide of defined length, 
5 usually a peptide of from about 8 to about 13 amino acids for a class I HLA motif and 
from about 6 to about 25 amino acids for a class II HLA motif, which is recognized by a 
particular HLA molecule. Peptide motifs are typically different for each protein encoded 
by each human HLA allele and differ in the pattern of the primary and secondary anchor 
residues. 

10 A "negative binding residue" or "deleterious residue" is an amino acid which, if 

present at certain positions (typically not primary anchor positions) in a peptide epitope, 
results in decreased binding affinity of the peptide for the peptide's corresponding HLA 
molecule. 

A "non-native" sequence or "construct" refers to a sequence that is not found in 

15 nature, i.e., is "non-naturally occurring". Such sequences include, e.g., peptides that are 
lipidated or otherwise modified, and polyepitopic compositions that contain epitopes that 
are not contiguous in a native protein sequence. 

The term "peptide" is used interchangeably with "oligopeptide" in the present 
specification to designate a series of residues, typically L-amino acids, connected one to 

20 the other, typically by peptide bonds between the a-amino and carboxyl groups of 
adjacent amino acids. The preferred CTL-inducing peptides of the invention are 13 
residues or less in length and usually consist of between about 8 and about 1 1 residues, 
preferably 9 or 10 residues. The preferred HTL-inducing oligopeptides are less than 
about 50 residues in length and usually consist of between about 6 and about 30 residues, 

25 more usually between about 12 and 25, and often between about 15 and 20 residues. 

"Pharmaceutically acceptable" refers to a generally non-toxic, inert, and/or 
physiologically compatible composition. 

A "primary anchor residue" is an amino acid at a specific position along a peptide 
sequence which is understood to provide a contact point between the immunogenic 

30 peptide and the HLA molecule. One to three, usually two, primary anchor residues 
within a peptide of defined length generally defines a "motif for an immunogenic 
peptide. These residues are understood to fit in close contact with peptide binding 
grooves of an HLA molecule, with their side chains buried in specific pockets of the 
binding grooves themselves. In one embodiment, for example, the primary anchor 
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residues are located at position 2 (from the amino terminal position) and at the carboxyl 
terminal position of a 9-residue peptide epitope in accordance with the invention. The 
primary anchor positions for each motif and supermotif are set forth in Table 1 . For 
example, analog peptides can be created by altering the presence or absence of particular 
5 residues in these primary anchor positions. Such analogs are used to modulate the 
binding affinity of a peptide comprising a particular motif or supermotif. 

"Promiscuous recognition" is where a distinct peptide is recognized by the same T 
cell clone in the context of various HLA molecules. Promiscuous recognition or binding 
is synonymous with cross-reactive binding. 

10 A "protective immune response" or "therapeutic immune response" refers to a 

CTL and/or an HTL response to an antigen derived from an infectious agent or a tumor 
antigen, which prevents or at least partially arrests disease symptoms or progression. The 
immune response may also include an antibody response which has been facilitated by 
the stimulation of helper T cells. 

15 The term "residue" refers to an amino acid or amino acid mimetic incorporated 

into an oligopeptide by an amide bond or amide bond mimetic. 

A "secondary anchor residue" is an amino acid at a position other than a primary 
anchor position in a peptide which may influence peptide binding. A secondary anchor 
residue occurs at a significantly higher frequency amongst bound peptides than would be 

20 expected by random distribution of amino acids at one position. The secondary anchor 
residues are said to occur at "secondary anchor positions." A secondary anchor residue 
can be identified as a residue which is present at a higher frequency among high or 
intermediate affinity binding peptides, or a residue otherwise associated with high or 
intermediate affinity binding. For example, analog peptides can be created by altering the 

25 presence or absence of particular residues in these secondary anchor positions. Such 
analogs are used to finely modulate the binding affinity of a peptide comprising a 
particular motif or supermotif. 

A "subdominant epitope" is an epitope which evokes little or no response upon 
immunization with whole antigens which comprise the epitope, but for which a response 

30 can be obtained by immunization with an isolated peptide, and this response (unlike the 
case of cryptic epitopes) is detected when whole protein is used to recall the response in 
vitro or in vivo. 
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A "superrnotif ' is a peptide binding specificity shared by HLA molecules encoded 
by two or more HLA alleles. Preferably, a supermotif-bearing peptide is recognized with 
high or intermediate affinity (as defined herein) by two or more HLA antigens. 

"Synthetic peptide" refers to a peptide that is man-made using such methods as 
5 chemical synthesis or recombinant DNA technology. 

As used herein, a "vaccine" is a composition that contains one or more peptides of 
the invention. There are numerous embodiments of vaccines in accordance with the 
invention, such as by a cocktail of one or more peptides; one or more epitopes of the 
invention comprised by a polyepitopic peptide; or nucleic acids that encode such peptides 

10 or polypeptides, e.g., a minigene that encodes a polyepitopic peptide. The "one or more 
peptides" can include any whole unit integer from 1-150, e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20,21,22, 23,24,25, 26, 27, 28, 29, 30,31,32, 33, 
34, 35, 36, 37, 38, 39, 40 , 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 
120, 125, 130, 135, 140, 145, or 150 or more peptides of the invention. The peptides or 

15 polypeptides can optionally be modified, such as by lipidation, addition of targeting or 
other sequences. HLA class I-binding peptides of the invention can be admixed with, or 
linked to, HLA class II-binding peptides, to facilitate activation of both cytotoxic T 
lymphocytes and helper T lymphocytes. Vaccines can also comprise peptide-pulsed 
antigen presenting cells, e.g., dendritic cells. 

20 The nomenclature used to describe peptide compounds follows the conventional 

practice wherein the amino group is presented to the left (the N-terminus) and the 
carboxyl group to the right (the C-terminus) of each amino acid residue. When amino 
acid residue positions are referred to in a peptide epitope they are numbered in an amino 
to carboxyl direction with position one being the position closest to the amino terminal 

25 end of the epitope, or the peptide or protein of which it may be a part. In the formulae 
representing selected specific embodiments of the present invention, the amino- and 
carboxyl-terminal groups, although not specifically shown, are in the form they would 
assume at physiologic pH values, unless otherwise specified. In the amino acid structure 
formulae, each residue is generally represented by standard three letter or single letter 

30 designations. The L-form of an amino acid residue is represented by a capital single letter 
or a capital first letter of a three-letter symbol, and the D-form for those amino acids 
having D-forms is represented by a lower case single letter or a lower case three letter 
symbol. Glycine has no asymmetric carbon atom and is simply referred to as "Gly" or G. 
Symbols for the amino acids are shown below. 
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Single Letter Symbol Three Letter Symbol Amino Acids 



A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic Acid 


E 


Glu 


Glutamic Acid 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Histidine 


I 


He 


Isoleucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


w 


Trp 


Tryptophan 


Y 


Tyr 


Tyrosine 



IV.B. Stimulation of CTL and HTL responses 

5 The mechanism by which T cells recognize antigens has been delineated during 

the past ten years. Based on our understanding of the immune system we have developed 
efficacious peptide epitope vaccine compositions that can induce a therapeutic or 
prophylactic immune response to HIV in a broad population. For an understanding of the 
value and efficacy of the claimed compositions, a brief review of immunology-related 
1 0 technology is provided. 

A complex of an HLA molecule and a peptidic antigen acts as the ligand 
recognized by HLA-restricted T cells (Buus, S. et al. 9 Cell 47:1071, 1986; Babbitt, B. P. 
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et al, Nature 317:359, 1985; Townsend, A. and Bodmer, H., Annu. Rev. Immunol 7:601, 
1989; Germain, R. N., Annu. Rev. Immunol 1 1 :403, 1993). Through the study of single 
amino acid substituted antigen analogs and the sequencing of endogenously bound, 
naturally processed peptides, critical residues that correspond to motifs required for 
5 specific binding to HLA antigen molecules have been identified and are described herein 
and are set forth in Tables I, II, and III (see also, e.g., Southwood, et al, J. Immunol 
160:3363, 1998; Rammensee, et al, Immunogenetics 41:178, 1995; Rammensee et al, 
SYFPEITHI, access via web at : http://134.2,96.221/scripts.hlaserver.dll/home.htm; Sette, 
A. and Sidney, J. Curr. Opin. Immunol. 10:478, 1998; Engelhard, V. H., Curr. Opin. 

10 Immunol 6:13, 1994; Sette, A. and Grey, H. M., Curr. Opin. Immunol 4:79, 1992; 

Sinigaglia, F. and Hammer, J. Curr. Biol 6:52, 1994; Ruppert etal, Cell 74:929-937, 
1993; Kondo etal, J. Immunol 155:4307-4312, 1995; Sidney et al, J. Immunol 
157:3480-3490, 1996; Sidney et al, Human Immunol. 45:79-93, 1996; Sette, A. and 
Sidney, J. Immunogenetics, in press, 1999). 

15 Furthermore, x-ray crystallographic analysis of HLA-peptide complexes has 

revealed pockets within the peptide binding cleft of HLA molecules which accommodate, 
in an allele-specific mode, residues borne by peptide ligands; these residues in turn 
determine the HLA binding capacity of the peptides in which they are present. (See, e.g., 
Madden, D.R. Annu. Rev. Immunol 13:587, 1995; Smith, et al, Immunity 4:203, 1996; 

20 Fremont et al, Immunity 8:305, 1998; Stern et al, Structure 2:245, 1994; Jones, E.Y. 

Curr. Opin. Immunol. 9:75, 1997; Brown, J. H. et al, Nature 364:33, 1993; Guo, H. C. et 
al, Proc. Natl Acad. Sci. USA 90:8053, 1993; Guo, H. C. et al, Nature 360:364, 1992; 
Silver, M. L. et al, Nature 360:367, 1992; Matsumura, M. et al, Science 257:927, 1992; 
Madden et al, Cell 70:1035, 1992; Fremont, D. H. et al, Science 257:919, 1992; Saper, 

25 M. A. , Bjorkman, P. J. and Wiley, D. C, J. Mol. Biol 219:277, 1991 .) 

Accordingly, the definition of class I and class II allele-specific HLA binding 
motifs, or class I or class II supermotifs allows identification of regions within a protein 
that have the potential of binding particular HLA antigen(s). 

The present inventors have found that the correlation of binding affinity with 

30 immunogenicity, which is disclosed herein, is an important factor to be considered when 
evaluating candidate peptides. Thus, by a combination of motif searches and HLA- 
peptide binding assays, candidates for epitope-based vaccines have been identified. After 
determining their binding affinity, additional confirmatory work can be performed to 



WO 01/24810 PCT/US00/27766 

17 

select, amongst these vaccine candidates, epitopes with preferred characteristics in terms 
of population coverage, antigenicity, and immunogenicity. 

Various strategies can be utilized to evaluate immunogenicity, including: 

1) Evaluation of primary T cell cultures from normal individuals (see, e.g., 

5 Wentworth, P. A. et aL, MoL Immunol 32:603, 1995; Celis, E. et aL, Proc. Natl. Acad. 
Set USA 91:2105, 1994; Tsai, V. et aL, J. Immunol 158:1796, 1997; Kawashima, I. et 
al, Human Immunol. 59:1, 1998); This procedure involves the stimulation of peripheral 
blood lymphocytes (PBL) from normal subjects with a test peptide in the presence of 
antigen presenting cells in vitro over a period of several weeks. T cells specific for the 

10 peptide become activated during this time and are detected using, e.g., a ^lCr-release 
assay involving peptide sensitized target cells. 

2) Immunization of HLA transgenic mice {see, e.g., Wentworth, P. A. et aL, J. 
Immunol 26:97, 1996; Wentworth, P. A. et al,Int. Immunol 8:651, 1996; Alexander, J. 
et aL, J. Immunol. 159:4753, 1997); In this method, peptides in incomplete Freund's 

15 adjuvant are administered subcutaneously to HLA transgenic mice. Several weeks 

following immunization, splenocytes are removed and cultured in vitro in the presence of 
test peptide for approximately one week. Peptide-specific T cells are detected using, e.g., 
a ^ 1 Cr-release assay involving peptide sensitized target cells and target cells expressing 
endogenously generated antigen. 

20 3) Demonstration of recall T cell responses from immune individuals who have 

effectively been vaccinated, recovered from infection, and/or from chronically infected 
patients (see, e.g., Rehermann, B. et al.,J. Exp. Med. 181:1047, 1995; Doolan, D. L. et 
aL, Immunity 7:97, 1997; Bertoni, R. et aL, J. Clin. Invest. 100:503, 1997; Threlkeld, S. 
C. et al f J. Immunol 159:1648, 1997; Diepolder, H. M. et aL, J. Virol. 71:6011, 1997); 

25 In applying this strategy, recall responses are detected by culturing PBL from subjects 
that have been naturally exposed to the antigen, for instance through infection, and thus 
have generated an immune response "naturally", or from patients who were vaccinated 
against the infection. PBL from subjects are cultured in vitro for 1-2 weeks in the 
presence of test peptide plus antigen presenting cells (APC) to allow activation of 

30 "memory" T cells, as compared to "naive" T cells. At the end of the culture period, T cell 

activity is detected using assays for T cell activity including SlCr release involving 
peptide-sensitized targets, T cell proliferation, or lymphokine release. 
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The following describes the peptide epitopes and corresponding nucleic acids of 
the invention. 



IV.C. Binding Affinity of Peptide Epitopes for HLA Molecules 

5 As indicated herein, the large degree of HLA polymorphism is an important factor 

to be taken into account with the epitope-based approach to vaccine development. To 
address this factor, epitope selection encompassing identification of peptides capable of 
binding at high or intermediate affinity to multiple HLA molecules is preferably utilized, 
most preferably these epitopes bind at high or intermediate affinity to two or more allele- 

1 0 specific HLA molecules. 

CTL-inducing peptides of interest for vaccine compositions preferably include 
those that have an IC50 or binding affinity value for class I HLA molecules of 500 nM or 
better (i.e., the value is < 500 nM). HTL-inducing peptides preferably include those that 
have an IC50 or binding affinity value for class II HLA molecules of 1000 nM or better, 

15 {i.e., the value is < 1,000 nM). For example, peptide binding is assessed by testing the 
capacity of a candidate peptide to bind to a purified HLA molecule in vitro. Peptides 
exhibiting high or intermediate affinity are then considered for further analysis. Selected 
peptides are tested on other members of the supertype family. In preferred embodiments, 
peptides that exhibit cross-reactive binding are then used in cellular screening analyses or 

20 vaccines. 

As disclosed herein, higher HLA binding affinity is correlated with greater 
immunogenicity. Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is elicited at all, and to the 
vigor of any particular response, as well as to the extent of a population in which a 

25 response is elicited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
with these principles, close to 90% of high binding peptides have been found to be 
immunogenic, as contrasted with about 50% of the peptides which bind with intermediate 
affinity. Moreover, higher binding affinity peptides lead to more vigorous immunogenic 

30 responses. As a result, less peptide is required to elicit a similar biological effect if a high 
affinity binding peptide is used. Thus, in preferred embodiments of the invention, high 
affinity binding epitopes are particularly useful. 
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The relationship between binding affinity for HLA class I molecules and 
immunogenicity of discrete peptide epitopes on bound antigens has been determined for 
the first time in the art by the present inventors. The correlation between binding affinity 
and immunogenicity was analyzed in two different experimental approaches {see, e.g., 
5 Sette, et al, J. Immunol 153:5586-5592, 1994). In the first approach, the 

immunogenicity of potential epitopes ranging in HLA binding affinity over a 10,000-fold 
range was analyzed in HLA-A*0201 transgenic mice. In the second approach, the 
antigenicity of approximately 100 different hepatitis B virus (HBV)-derived potential 
epitopes, all carrying A*0201 binding motifs, was assessed by using PBL from acute 

10 hepatitis patients. Pursuant to these approaches, it was determined that an affinity 
threshold value of approximately 500 nM (preferably 50 nM or less) determines the 
capacity of a peptide epitope to elicit a CTL response. These data are true for class I 
binding affinity measurements for naturally processed peptides and for synthesized T cell 
epitopes. These data also indicate the important role of determinant selection in the 

15 shaping of T cell responses {see, e.g., Schaeffer et al Proc. Natl. Acad. ScL USA 
86:4649-4653, 1989). 

An affinity threshold associated with immunogenicity in the context of HLA class 
II DR molecules has also been delineated {see, e.g., Southwood et al J. Immunology 
160:3363-3373,1998, and co-pending U.S.S.N. 09/009,953 filed 1/21/98). In order to 

20 define a biologically significant threshold of DR binding affinity, a database of the 

binding affinities of 32 DR-restricted epitopes for their restricting element {i.e., the HLA 
molecule that binds the motif) was compiled. In approximately half of the cases (15 of 32 
epitopes), DR restriction was associated with high binding affinities, i.e. binding affinity 
values of 100 nM or less. In the other half of the cases (16 of 32), DR restriction was 

25 associated with intermediate affinity (binding affinity values in the 100-1000 nM range). 
In only one of 32 cases was DR restriction associated with an IC50 of 1000 nM or greater. 
Thus, 1000 nM can be defined as an affinity threshold associated with immunogenicity in 
the context of DR molecules. 

The binding affinity of peptides for HLA molecules can be determined as 

30 described in Example 1, below. 



IV.D. Peptide Epitope Binding Motifs and Supermotifs 

Through the study of single amino acid substituted antigen analogs and the 
sequencing of endogenously bound, naturally processed peptides, critical residues 
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required for allele-specific binding to HLA molecules have been identified. The presence 
of these residues correlates with binding affinity for HLA molecules. The identification 
of motifs and/or supermotifs that correlate with high and intermediate affinity binding is 
an important issue with respect to the identification of immunogenic peptide epitopes for 
5 the inclusion in a vaccine. Kast et al. (J. Immunol. 152:3904-3912, 1994) have shown 
that motif-bearing peptides account for 90% of the epitopes that bind to allele-specific 
HLA class I molecules. In this study all possible peptides of 9 amino acids in length and 
overlapping by eight amino acids (240 peptides), which cover the entire sequence of the 
E6 and E7 proteins of human papillomavirus type 16, were evaluated for binding to five 

10 allele-specific HLA molecules that are expressed at high frequency among different 

ethnic groups. This unbiased set of peptides allowed an evaluation of the predictive value 
of HLA class I motifs. From the set of 240 peptides, 22 peptides were identified that 
bound to an allele-specific HLA molecule with high or intermediate affinity. Of these 22 
peptides, 20 (i.e. 91%) were motif-bearing. Thus, this study demonstrates the value of 

15 motifs for the identification of peptide epitopes for inclusion in a vaccine: application of 
motif-based identification techniques will identify about 90% of the potential epitopes in 
a target antigen protein sequence. 

Such peptide epitopes are identified in the Tables described below. 

Peptides of the present invention may also comprise epitopes that bind to MHC 

20 class II DR molecules. A greater degree of heterogeneity in both size and binding frame 
position of the motif, relative to the N and C termini of the peptide, exists for class II 
peptide ligands. This increased heterogeneity of HLA class II peptide ligands is due to 
the structure of the binding groove of the HLA class II molecule which, unlike its class I 
counterpart, is open at both ends. Crystallo graphic analysis of HLA class II DRB*0101- 

25 peptide complexes showed that the major energy of binding is contributed by peptide 
residues complexed with complementary pockets on the DRB*0101 molecules. An 
important anchor residue engages the deepest hydrophobic pocket (see, e.g., Madden, 
D.R. Ann. Rev. Immunol. 13:587, 1995) and is referred to as position 1 (PI). PI may 
represent the N-terminal residue of a class II binding peptide epitope, but more typically 

30 is flanked towards the N-terminus by one or more residues. Other studies have also 
pointed to an important role for the peptide residue in the 6 th position towards the C- 
terminus, relative to PI, for binding to various DR molecules. 

In the past few years evidence has accumulated to demonstrate that a large 
fraction of HLA class I and class II molecules can be classified into a relatively few 
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supertypes, each characterized by largely overlapping peptide binding repertoires, and 
consensus structures of the main peptide binding pockets. Thus, peptides of the present 
invention are identified by any one of several HLA-specific amino acid motifs (see, e.g., 
Tables I-III), or if the presence of the motif corresponds to the ability to bind several 
5 allele-specific HLA antigens, a supermotif The HLA molecules that bind to peptides that 
possess a particular amino acid supermotif are collectively referred to as an HLA 
"supertype." 

The peptide motifs and supermotifs described below, and summarized in Tables I- 
III, provide guidance for the identification and use of peptide epitopes in accordance with 

10 the invention. 

Examples of peptide epitopes bearing a respective supermotif or motif are 
included in Tables as designated in the description of each motif or supermotif below. 
The Tables include a binding affinity ratio listing for some of the peptide epitopes. The 
ratio may be converted to IC50 by using the following formula: IC50 of the standard 

15 peptide/ratio = IC50 of the test peptide (i.e., the peptide epitope). The IC50 values of 

standard peptides used to determine binding affinities for Class I peptides are shown in 
Table IV. The IC50 values of standard peptides used to determine binding affinities for 
Class II peptides are shown in Table V. The peptides used as standards for the binding 
assays described herein are examples of standards; alternative standard peptides can also 

20 be used when performing binding studies. 

To obtain the peptide epitope sequences listed in each Table, protein sequence 
data for all of the HIV-1 isolates present in the 1999 Los Alamos database (http://hiv- 
web.lanl.gov) were evaluated for the presence of the designated supermotif or motif. A 
listing of the strains is provided in Table XXVI. Nine HIV-1 structural and regulatory 

25 proteins, gag, pol, env, nef, rev, tat, vif, vpr, and vpu, were included in the analysis. 

Peptide epitopes were additionally evaluated on the basis of their conservancy (i.e., the 
amount of variance) among the available protein sequences for each HIV antigen. A 
criterion for conservancy used to generate the peptides set out in Tables VII-XX requires 
that the entire sequence of an HLA class I binding peptide be totally conserved in 15% of 

30 the sequences available for a specific HIV antigen. Similarly, a criterion for conservancy 
requires that the entire 9-mer core region of an HLA class II binding peptide be totally 
conserved in 15% of the sequences available for a specific protein. The percent 
conservancy of the selected peptide epitopes is indicated on the Tables. The frequency, 
i.e. the number of sequences of the HIV protein antigen in which the totally conserved 
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peptide sequence was identified, is also shown. The "pos" (position) column in the 
Tables designates the amino acid position in the HIV protein that corresponds to the first 
amino acid residue of the epitope. The "number of amino acids" indicates the number of 
residues in the epitope sequence. 

5 

HLA Class I Motifs Indicative of CTL Inducing Peptide Epitopes: 

The primary anchor residues of the HLA class I peptide epitope supermotifs and 
motifs delineated below are summarized in Table I. The HLA class I motifs set out in 
Table 1(a) are those most particularly relevant to the invention claimed here. Primary and 
10 secondary anchor positions are summarized in Table II. Allele-specific HLA molecules 
that comprise HLA class I supertype families are listed in Table VI. In some cases, 
peptide epitopes may be listed in both a motif and a supermotif Table. The relationship 
of a particular motif and respective supermotif is indicated in the description of the 
individual motifs. 

15 

IV.D.1. HLA-A1 supermotif 

The HLA-A1 supermotif is characterized by the presence in peptide ligands of a 
small (T or S) or hydrophobic (L, I, V, or M) primary anchor residue in position 2, and an 
aromatic (Y, F, or W) primary anchor residue at the C-terminal position of the epitope. 

20 The corresponding family of HLA molecules that bind to the Al supermotif (i.e., the 
HLA-A1 supertype) is comprised of at least A*0101, A*2601, A*2602, A*2501, and 
A*3201 (see